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ABSTRACT 

I 

Suppose  Che  basic  shape  of  Che  cumulaCive  failure  (hazard) 
funcclon  has  been  Idenclfled  for  a cercaln  componenC,  and  . 
chac  an  unknown  paratneccr  for  a new  produccion  run  of 
similar  componencs  is  Co  be  escimaccd.  In  parCicular, 

'&Qf»PQse  chac  Che  failure  funcclon  is  of  proportional  Cype, 
R(x)  “(3q(x)  , where  Q is  Che  known  shape  funcCion,  and 
chac  ^0)  is  sampled  from  a prior  gamma  denslcy.  By  using 
a new  scacictlc,  called  Che  total  Q on  teat  (TQT) , iC  is 
possible  CO  perform  Bayesian  updaclng  during  a variety  of 
llfeClme  testing  programs  in  a manner  similar  to  total  time 
on  test  plots.  This  statistic  can  also  be  used  with  com- 
plete lifetime  data,  extending  over  several  product  runs, 

CO  identify  the  failure  form  Q , and  to  estimate  the  gamma 
hyperparameters.  Extensions  Include  the  use  of  several 
TQT  statistics  to  estimate  the  relative  strength  of  compet- 
ing hazard  functions. 
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BAYESIAN  LIFE  TESTING  USING  THE  TOTAL  Q ON  TEST 


by 

William  S.  Jewell 


1.  INTRODUCTION 

Suppose  that  the  random  lifetime,  jc  , of  a certain  component  type 
has  a distribution  function  P(x  [ 0)  , where  0 is  an  unknown  parameter 
that  may  vary  from  one  production  run  to  another,  but  is  assumed  constant 
within  each  run.  We  assume  a prior  distribution  of  interbatch  variation, 
P(0)  , is  available. 

There  are  two  main  types  of  estimation  problems  within  the  Bayesian 
framework: 

(1)  Model  identification,  in  which  the  forms  of  P(x  | 6)  and 
P(0)  are  to  be  Inferred  from  previous  production  runs; 

(2)  Parameter  estimation , in  which  selected  components  from  a 
given  batch  are  operated  under  normal  or  accelerated  test 
conditions  to  infer  the  particular  value  of  0 for  that 
batch. 

These  two  problems  are  interrelated,  since  the  choice  of  model  is  often 
made  to  simplify  parameter  estimation,  and  the  success  of  Bayesian  up- 
dating depends  upon  how  close  the  assumed  models  are  to  the  true  ones. 
However,  a third  factor  which  influences  both  of  these  tasks,  and  which 
is  often  overlooked,  is  the  test  design. 

In  the  following,  we  shall  examine  Bayesian  estimation  under  normal, 
but  Incomplete  test  conditions.  We  first  show  that  a convenient  modeling 
family  is  the  proportional  hazard  class  (with  generalization  to  competing 


hazard  families),  and  tn«  , with  a certain  natural  conjugate  prior, 
parameter  estimation  Is  easily  carried  out  through  a test  statistic 
we  call  "total  Q on  test,"  a generalization  of  the  concept  of  "total 
time  on  test."  Furthermore,  model  Identification  can  be  simplified 
by  using  special  total  Q on  teat  plots  of  the  sample  data. 
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2.  INCOMPLETK  LIFE  TESTS 

The  clnsslcnl  manner  In  which  lifetime  distributions  are  inferred 
is  to  place  N components,  assumed  to  have  the  same  values  of  6 , in  a 
test  environment  which  matches  normal  operating  conditions  for  T hours, 
or  until  all  components  have  all  failed,  if  earlier.  Normally,  the 
economics  of  testing  arc  such  that  the  (random)  number  which  have  failed, 
C(T)  , is  much  less  than  N . (See,  e.g.,  [1]  where  this  scheme  is  called  a 
truncated  data  test,  and  (5],  where  it  is  a Type  I censo.'ed  test).  The 
outcomes  of  the  incomplete  test  can  be  described  in  terms  of  a data  set 
V of  the  ordered  random  lifetimes  random  C(T)  as 

fol lows : 


If  p(x  I 0)  and  P*"(x  | 0)  are  the  density  and  complementary  (call) 
distribution,  respectively,  corresponding  to  P(x  | 6)  , then  the  likelihood 
of  the  data  set  V is: 


l(V  I 6)  - p(Xj^,X2»  ...,  Xjj  ; C(T)  [ 0) 

C 

(2.2)  - Tn-C)'!'  " pW  I I 0)] 


where  unit  impulse  (degenerate)  densities  6(x) 
N 

I J/UP  I 0)dx  dx»  •••  dx  =>  1 . 

C-0 


N 

n 6(x  - T) 
i-C+1 

(x^  < X2  < • • • ^ T) 


are  used  for  convenience 


so  that 
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An  alternate  testing  scheme  (called  a censored  data  test  in  [1]  and 
a Type  II  censored  test  in  [5]),  fixes  the  number  of  failures,  C , in 
advance,  and  thus  the  duration  of  the  test  is  a ra.'dom  variable, 

T(C)  - likelihood  (2.2)  becomes  a density 


L(V  I 6)  - p(Xj^,Xj Xjj  I 0) 

^N-C  N 

6)  n 6(x.  - Xp)  . 

-*  1-C+l  ^ ^ 

(*1  ^ *2  - ’**  - 


with  //up  I 0)dXj^,dx2  •••  dXj^  ■ 1 . 


(2.3) 


*(N-C)  I P^*i  I 
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3.  PROPORTIONAL  HAZAIU)  FAMILY:  _ TOTAL  Q ON  TEST  AND  MAXIMUM 
LIKELIHOOD  ESTIMATORS 

Although  (2.2;  and  (2.3)  could,  In  principle,  be  calculated  for  any 
underlying  distribution,  it  is  desirable  to  pick  a family  which  is  both 
useful  from  a modeling  point  of  view,  and  simple  computationally. 

The  Koopman-Pitman-Darmois  exponential  family  has  extremely  useful  proper- 
ties in  regular  Bayesian  forecasting  (2]  (4);  but,  it  has  no  convenient 
integral  form  and  does  not  seem  useful  in  life  testing  applications. 

However,  every  complementary  distribution  function  can  be  written 
P*^(x  I 0)  “ exp(-  R(x  I 0))  , where  R(x  | 0)  is  the  cumulative  failure 
(hazard)  function,  given  0 ; a convenient  way  to  parametrize  this 
form  is  to  suppose  that  R(x  | 0)  = 0Q(x)  . Since  Q(x)  is  a monotone 
nondecreasing  function  which  contain  all  of  the  failure  shape  information 
(thus  preserving  the  properties  of  IFR,  IFRA,  etc.  [1]  for  all  0),  we 
could  call  it  the  unit-  or  pro  to  type -failure  function,  and  refer  to  the 
density  and  cumulative  distributions: 

(3.1)  P‘"(x  I 0)  = ; p(x  ] 0)  - 0q (x)e“®‘^^’‘^  ; 

as  the  proportional  hazard  family  (q(x)  = dQ(x)/dx)  . We  note  the 
following  properties: 

(a)  The  family  Includes  the  exponential,  Welbull  with  given  shape 
parameter,  and  Gumbel  distributions; 

(b)  A random  lifetime,  ST  , with  distribution  (3.1)  and  0 an 

integer  can  be  Interpreted  as  jc  = min(y',  .y*., , , 

where  Pr{'y^  > y}  ■ exp(-Q(y))  , for  all  i = 1,2,  ....  0 , 
which  suggests  a certain  physical  failure  model  for  this  family; 


I 


I 


I 


I 


A 


•I 
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(c)  It  may  be  reasonable  that  production  of  different  lots 
under  varying  conditions  maintains  the  same  shape  for 
the  hazard  function  and  changes  only  the  relative  Intensity 
of  failures. 


Most  Importantly,  this  family  greatly  simplifies  estimation  In  life- 
testing,  since  the  likelihood  (2.2)  becomes 

p(Xj^,X2,  ...»  j C I 6) 

C N (p— 6[ TQT (x. ,X2,  ...,  x-  t C) ] 

^ " 6(x  -T){e^e 

(Xj^  < X2  < •••  < x^) 


where  call 


C 

(3.3)  TQT(x^,X2,  ....  x^  ; C - C(T))  = I Q(x^)  + (N-C)Q(T) 


the  total  Q on  test,  a statistic  of  varying  dimensions  that  Is  a 
natural  generalization  of  the  total  time  on  test  concept  for  the  ex- 
ponential density,  Q(x)  * x [1]  . Corresponding  to  (2.3)  and  the  second 
testing  scheme,  we  obtain  a similar,  but  flxed-dlmenslonal  statistic: 

C 

(3.4)  TQT(Xj^,X2,  ...,  x^)  - I Q(x^)  + (N-C)Q(x^)  , 


and  a likelihood  similar  to  (3.2). 

Since  only  the  term  in  braces  in  (3.2)  is  a function  of  0 , It  is 
easy  to  show  that  the  maximum  likelihood  estimators  of  6 and  1/6  are: 


/V 


(3.5) 


e"^  - mle{(0)"M  - ; e - mle{?}  - 


TQT(^) 


which  coincides  with  known  results  for  the  exponential  density  under  the 
two  tests  [1]  [5] . 
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4.  OTHER  LIFE  TESTS 

A variety  of  other  life  tcating  schemes  also  use  appropriate  generali- 
zations of  the  total  Q on  test  statistic.  For  sxample,  If  N units 
are  placed  on  test,  but  a aet  of  fixed,  but  possibly  different  test  durations 
{T^}  are  given  for  all  units,  then  we  observe  ■ mlnfx^,T^}  . The 
likelihood  of  this  unordered  data  is: 


p(yi.y2*  •••.  yjj  ; c 1 e) 


(4.1) 


- n 

leC 


q(y^) 


n 6(y 


J 


-TQT(y  ,y,, 
e e ^ ^ 


O] 


(0 


< < ») 


where  C-{i  ...»  N)  and  C ■ ||C||  . The  total  Q 

on  test  Is 


N 

(4.2)  TQT(y  ,y  , . . . , y„  ; C)  - I Q(y  ) - \ Q(x  ) + [ Q(T  ) . 

^ ^ ” 1-1  ^ IeC  ^ jfC  J 


This  Is  called  testing  with  withdrawals  in  [1]. 

Another  testing  procedure  might  be  called  renewal  testing.  Suppose 
a component  Is  started  on  a single  test  stand  at  time  zero;  after  the 
first  failure,  a new  Item  Is  started;  and  so  on,  until  the  procedure 
terminates  at  a fixed  time  T , with  the  (c  + 1)®^  item  still  operative. 


The  likelihood  associated  with  V - - x^^,  ^2  " *2’ 

C ) 

I X J is  just: 

1-1 


* *C  " *C  * 


‘5+1 


> T 


C 1 _ — TQT(x. ,X2 , . . . , X-  ; C)  j 

(4.3)  p(x^,X2,  . . . , Xj,  ; C I 0)  - H q(Xj^)<0  e . 


1-1 
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C 

over  the  simplex  0 < ^ x < T , with  total  Q on  test 

■ 1-1  ^ ■ 

(4.4)  TQT(x  ,x, X.  ; C)  - I Q(x.)  + q(t  - I x J . 

^ 1-1  ^ \ 1-1  7 

The  generalization  to  stopping  with  the  failure,  multiple  test 

stands,  and  other  combined  censorlng-truncating-wlthdrawal-renewal 

test  schemes  should  now  be  obvious. 

In  fact,  with  the  proportional  hazard  family  It  Is  clear  that,  no 

matter  what  the  test  set-up,  one  needs  only  to  monitor  the  actual  (complete 

or  Incomplete)  lifetimes  (y^)  for  each  item,  and  the  (fixed  or  stochastic) 

number,  C , of  completed  tests.  Then  TQT  - ^ Q(yj)  and  (3.5)  and 

all  1 ^ 

the  Bayesian  formulae  of  the  next  section  always  hold. 
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5.  BAYESIAN  UPDATING  UNDER  LIFE  TESTING;  MIXED  DISTRIBUTIONS 

In  a Bayesian  formulation,  we  do  not  use  (3.5)  directly  because 
we  have  prior  Information  on  the  possible  values  of  6 , perhaps  through 
cumulative  data  on  previous  production  runs,  or.  If  one  Is  so  persuaded, 
from  sampling  expert  opinion.  To  simplify  the  application  of  Bayes' 
law.  It  Is  convenient  to  pick  a natural  conjugate  prior,  whose  form 
matches  that  In  the  braces  In  (3.2);  this  is  Just  the  gamma  density 


(5.1) 


P(6)  - 


C -1  -0Q 

Q„(Q„6)  0 . ° 

r(c^) 


(0  < 0 < ®) 


with  hyperparameters  C , Q • The  usefulness  of  (5.1)  In  modeling  unlmodal 

o 0 

densities  over  [0,»)  Is  well-known.  It  Is  easy  to  see  that  Bayes' 
law  then  gives  a posterlor-to-data  density  of  0 , p(0  | P)  , which  Is 
also  gamma,  but  with  updated  hyperparameters: 

C C + C , 

(5.2)  ° ° 

Qq  ^ Qo  + TQT(P)  • 


In  place  of  the  classical  estimators  (3.5),  we  have  the  posterior-to-data 
expectations 

(5.3)  E{(?)"^  I V)  - (1-Z^)E{'T^}  + z^e"^(p) 

(5.4)  [E{0  I V}]~^  - (1-Z2)[E{0}]'^  + Z2[0(P)]'^ 

with  "credibility  factors" 

Z,  - C/(C  - 1 + C)  ; Z»  - C/(C  + C)  . 

10  CO 


(5.5) 
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This  mcani  Chat  If  Che  ceic  gives  a large  number  of  complete  observations, 
relative  to  , Chen  the  Bayesian  and  MLE  estimators  coincide.  However, 
for  relatively  incomplete  tests,  more  weight  is  given  the  prior  expecta- 
tions, or  . Linear  mixing  formulae  of  this  type  are  well- 

known  In  Bayesian  forecasting  [2]  [3]  [4]. 

It  is  also  of  interest  to  examine  the  form  of  the  mixed  density  of 
lifetimes,  obtained  by  averaging  over  many  batches.  From  (3.1)  and  (3.1) 

-C 

P‘=(x)  - £P‘^(X  I 0)  - {1+ lQ(x)/Q^]}  ° ; 

-C  -1 

p(x)  - (C^q(x)  I Qq]U  + [Q(x)/Q^n  ° ; 

which  is  a generalization  of  a shifted  Pareto.  We  see  that  if  the  prototype 
failure  function  is  Gumbel,  we  get  exponential  tails  (for  large  x)  in  the 
mixed  distribution,  while  if  the  underlying  failures  are  Weibull,  we  get  the 
"more  dangerous"  algebraic  tails.  If  life  test  data  V are  drawn  from  a 
batch  with  fixed  6 - 0^^  , it  follows  that  the  forecast  distribution  for 
the  remaining  components  from  this  batch,  P^(x  | P)  , is  of  form  (5.6)  with 
updated  parameters  (5.2).  Furthermore,  one  can  show  that  if  the  test  data  set 
P is  large  enough,  P*^(x  | P)  approaches  exp{-0j^Q(x) } , the  true  dis- 
tribution for  this  batch,  with  probability  one. 

Generally,  the  failure  function  of  the  mixed  distribution  Increases 
less  rapidly  than  that  of  the  prototype  hazard: 

(5.7)  ^iixED^*^  “ ■ (Q(x)/Qo)l 

and  may  be  DFR  even  if  q(x)  is  Increasing.  (5.7)  should  not  be  confused 
with  the  average  behavior  of  the  individual  failure  functions: 


(5.8) 


R(x)  - ER(x  I ?)  - C Q(x)/Q„ 

0 o 


which  alwiyi  hai  the  ahapa  Q(x)  . 
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6.  TOTAL  Q ON  TEST  TRANSFORM 

We  turn  now  to  the  problem  of  Identifying  the  prototype  failure 
function,  Q(x)  , and  the  hyperparametera  let  ua  examine 

the  effect  of  plotting  TQT  with  a Q which  la  not  the  neceaaarlly  correct 
one. 

Paralleling  Barlow  and  Campo  [1],  we  define  the  total  Q on  teat 
tmnefom  (TQTT)  of  any  dlatrlbutlon  F with  respect  to  Q as: 

F“^(t) 

(6.1)  H"^(t)  - HpJ^Ct)  ~ J [1  - F(u)]dQ(u)  t e [0,1]  . 

0 

This  awkward  notation  Is  used  because  Hp^p  |the  inverse  of  ^p/qj 
a distribution  on  [0,y]  , where 

f"^(1) 

(6.2)  y - h’^1)  - J F‘^(u)dQ(u)  < » , 

0 

and  Is  the  mean  of  F If  Q(x)  - x , the  usual  total  time  on  test  transform. 
Now  if  F*^(x)  ■ exp{-0Q(x)}  , it  is  easy  to  verify  that 

(6.3)  H'^(t)  - t/e  , t e [0,1] 


while,  for  a general  failure  function,  f'^(x)  ■ exp{-Rp(x)} 


(6.4) 


dH'^t) 

dt 


X - F"^(t) 


rp(^ 


where  Tp(x)  ■ dRp(x)/dx  . 
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Thus,  if  we  could  plot  the  TQTT  for  any  Q , we  could  Isolate  those  areas 
where  r(x)  was  larger  [smaller]  than  q(x)  as  those  regions  where  the 
TQTT  has  slope  less  [greater]  than  unity.  Constants  of  proportionality 
can  be  eliminated  by  plotting  the  scaled  total  Q on  test  transform 
(STQTT) , H ^(t)/H  ^(1)  , thus  giving  us  a continuous  Increasing  function  on 
the  unit  square  which  Intersects  (0,0)  and  (1,1)  . And  the  cluster  Q(x) 
Is  to  the  form  of  R(x)  , the  closer  the  STQTT  is  to  the  straight  line 
(obtained  for  the  exponential  distribution  with  the  conventional  total  time 
on  test  transform). 

The  empirical  usefulness  of  this  for  Identification  can  be  seen  from 
the  following.  Suppose  we  have  observed  a censored  life  test  for  a sample 
of  N items  from  the  proportional  hazard  family  with  fixed,  but  unknown 
6 , say  F(x)  ■=  P(x  | 6)  , and  there  are  C failures.  The  empirical  life- 
time distribution,  F(|.(x)  , can  be  obtained  from  the  ordered  complete 
samples  •• 


0 < x < X 


1:N 


(6.5) 


Fc(*)“\i/C  ’'i:N  - - ^i+l:N 


^C:N  - * 


however  this  is  unlikely  to  be  of  direct  use  because  of  the  small  size  of  C . 
Suppose  we  plot  Instead  the  empirical  Q on  test  ratio 


(6.6) 


EQTR(i  I V) 


) 

) 


versus  (i  | C)  (1  «=  1,2,  ...,  C)  , where  TQT  is  the  statistic  defined  in 
(3.4).  Since 
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F"^(C/N) 

(6.7)  ^ >‘C:N^  " / U - Fj.(u)]dQ(u)  . 

0 

one  can  show,  following  [1],  chat 

(6.8)  EQTRd  I V)  (0  < t < u < 1) 

h‘^(u)  ■ ■ ■ 

uniformly  as  N -*■  * , and  l/N  -►  t , C/N  -♦  u . In  other  words,  the  empirical 
Q on  test  ratio  tends  to  look  like  the  scaled  total  Q on  test  transform 
as  the  number  of  data  points  increases,  and  thus  can  be  used  for  model 
Identification  merely  by  trying  different  Q functions,  until  a satisfactory 
straight  line  approximation  is  obtained.  Naturally,  for  any  finite  sample, 
there  will  be  fluctuations  in  the  ratio  (6.6);  Barlow  and  Campo  [1]  give 
results  for  the  exponential  case  using  total  time  on  test,  which  should 
also  be  useful  once  an  approximate  form  Q has  been  found. 

Perhaps  it  is  appropriate  at  this  point  to  compare  our  procedure  for 
nonexponential  model  identification  with  chat  set  forth  in  Barlow  and 
Campo,  assuming  always  there  is  sufficient  data.  In  their  approach,  they 
would  plot  empirical  ratios  corresponding  to  the  scaled  total  time  on  test 
(e.g.  (6.6)  and  (3.4)  with  Q(x)  ■ x) ; departure  from  the  exponential  would 
show  up  as  a departure  from  a straight  line.  Special  characteristics, 
such  as  IFR,  IFRA,  NBU,  etc.  would  be  apparent  from  inspection,  and  then 
through  the  use  of  families  of  transparent  overlays  on  the  unit  square, 
the  final  distribution  would  be  chosen  visually. 

In  our  approach,  we  propose  to  try  successive  forms  for  Q , plotting 
Che  empirical  ratio  (6.6)  each  time.  Regions  of  departure  from  a straight 
line,  together  with  (6.4),  are  supposed  to  suggest  new  empirical  modifica- 
tions in  Q , which  are  carried  out  until  Che  EQTR  is  apparently  a straight 


line. 
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7.  EXAMPLE 

Figures  1 through  5 show  various  empirical  Q on  test  ratios,  based 

on  107  samples  for  the  right  rear  brake  of  a D9G-66A  caterpillar  tractor 

given  by  Barlow  and  Campo  [1].  Three  prototype  failure  functions, 

115  2 

Q(x)  X , X * , and  x are  given  In  each  plot,  from  top  to  bottom 

respectively. 

Figures  1,  2,  and  3 illustrate  the  effect  of  varying  the  number  of 
complete  samples,  with  Figure  1 showing  the  original  107  lifetimes,  and  2 
and  3 showing  the  curves  with  only  54  and  27  lifetimes,  selected  at  random. 
Clearly  the  variability  Increases  with  decreasing  data,  although 
Q(x)  ■ x^'^  Is  always  the  best  of  the  functions  chosen.  Figure  1 suggests, 
via  (6.4),  that  a slight  modification  of  the  failure  function 
between  0-0.05  and  0.65-1.00  would  give  a better  fit. 

Figures  4 and  5 show  the  effect  of  Incomplete  life  data  by  censoring 
the  original  107  samples  at  the  54th  and  27th  lifetimes,  respectively. 

Again,  the  model  comparison  Is  remarkably  consistent,  even  with  small 
duration  tests,  although  the  discrimination  (separation  between  the  plots) 


Is  less  with  censored  data. 


^ * 


•j 

I 

f 

J 


t'. 


i 

) 


FIGURE  1:  EMPIRICAL  Q ON  TEST  RATIO.  107  COMPLETE  SAMPLES,  RIGHT  REAR 

BRAKE  [BARLOW  AND  CAMPOJ.  Q(x)  - , x^*^  , AND  x^  . 
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FIGURE  2:  EMPIRICAL  Q ON  TEST  RATIO.  54  COMPLETE  SAMPLES,  RIGHT  REAR 

BRAKE  [BARLOW  AND  CAMPO].  Q(x)  » , x^*^  , AND  x^  . 


FIGURE  3:  EMPIRICAL  Q ON  TEST  RATIO.  27  COMPLETE  SAMPLES,  RIGHT  REAR 
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BRAKE  [BARLOW  AND  CAMPO].  Q(x)  - x , x * , AND  x . 
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8.  DATA  FROM  SEVERAL  BATCHES;  ESTIMAIION  OF  HYPERPARAMETERS  AND  MODEL 
IDENTIFICATION 

The  use  of  (6.6)  to  Identify  Q depends  on  hsvlng  sufficient  complete 
lifetimes  for  a fixed  value  of  6 . However,  a more  usual  situation  would 
be  that  we  have  a moderate  number  of  samples  from  several  production  runs, 
each  with  different  values  of  6 . 

If  we  pool  the  data,  then  the  resulting  mixed  distribution  will  be 
given  by  (5.6),  from  which  we  see  that  It  Is  difficult  to  extricate  the 
identification  of  Q from  the  estimation  of  C and  Q . In  fact,  the 
TQTT  of  P(x)  with  respect  to  the  correct  Q(x)  Is  not  a straight  line, 
but  Is 


(8.1) 


H"^(t) 


C -1/C  ] 
o o I 


which  is  straight  only  for  C^  large.  A normalized  plot  of  this  function 
Is  shown  in  Figure  6. 

However,  maximum  likelihood  estimates  C^  , can  be  reliably  obtained 
from  a large  amount  M of  pooled  data,  given  Q . First,  is  found  so 

that  the  following  two  sums  are  equal. 


(8.2) 


Q(x^) 

Qo  + Q(x,) 


M 


I 

i»l 


1 


Q(x^) 


and  then: 


(8.3) 


There  remains  the  problem  of  identifying  Q . We  suggest  the  following: 


FIGURE  6:  NORMALIZED  TOTAL  Q ON  TEST  TRANSFORM  FOR  THl 

MIXED  PROPORTIONAL  HAZARD  FAMILY  WITH  THE  CORl 
Q(x)  , FOR  VALUES  OF  THE  HYPERPARAMETER  C - 
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(1)  For  each  value  of  d (production  run),  plot  the  empirical 
ratio  (6.5)  for  the  appropriate  number  of  completed  lifetimes; 

(2)  Connect  these  points  by  straight  lines,  or  other  reasonable 
curves ; 

(3)  For  various  values  of  the  abscissa,  find  the  average  value  of 
the  ordinate  over  all  runs. 

Even  though  the  Individual  curves  may  vary  because  of  the  few  samples 
per  batch,  the  average  STQTT  should  more  nearly  reflect  the  theoretical 
transform  curve.  One  then  varies  the  form  of  Q , as  before,  until  a 
satisfactory  fit  Is  obtained. 

Figure  7 shows  this  procedure  for  5 runs,  with  10  complete  samples  in 
each  run,  when  R(Q  ^(x))  ■ . The  raw  data  is  shown  unconnected,  the 

average  values  over  all  runs  for  t ■ 0(0. 1)1.0  are  connected  by  straight 
line  segments,  while  the  theoretical  STQTT  is  the  curvilinear  line.  Limited 
computational  experience  with  other  simulated  data  gives  similar  results, 
surprisingly  smooth  when  averaged  over  all  runs. 
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9.  EXTENSIONS 

A natural  extension  to  (3.1)  Is  to  consider  the  competing  hazard 

fami  ly 


(9.1) 


Here  the  , and  their  derivatives,  , are  prototype  failure  con- 
stituents which  are  competing  In  an  unknown  mixture  specified  by  parameters 

6,, 6.,  ...,  8 , subject  to  a known  m-dlmenslonal  prior  density.  For 

12  m 

instance,  with  m * 2 , one  constituent  might  be  random,  Qj^(x)  ■ 1 , and 
the  other  IFRA,  p2(x)/x  f In  x . Unfortunately,  the  likelihood  Is  now 
more  complex;  corresponding  to  (3.2),  we  have  for  m > 2 


N! 


N 


P^*l’*2 *N  ’ ^ ' ®1’®2^  " (N  -’o: 


(9.2) 


n (e^qj^(x^)  + 92q2(x^)lje 


-ej^TQ^T(P)-02TQ2T(P)] 


where  TQ^T  and  TQ2T  refer  to  the  two  possible  test  statistics.  In  place  of 
0 

6 , we  have  C terms  corresponding  to  the  possible  powers  of  6^  and 

62  • The  term  In  Inner  braces  is  proportional  to 


C 

I 


1-1 


Pi(t?)0102"^ 


where 
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(9.3)  p^(t?) 


K*  n 

permutations 


(C  - 1)  terms 


and  K Is  adjusted  so  that  P^((^)  " 1 • The  complexity  for  general  m 
is  easily  visualized. 

A convenient  prior  consists  of  independent  gammas: 


(9.4) 


P(O^,02 


^ol'^ol’^o2’^o2^ 


f C ,-l  -Q  ,0, 
0 e 1 


)( 


C --1  - 
a o2 
6 e 


Qo2®2 


): 


we  see  that  the  posterior-to-data  density  is  a mixture  of  C terms: 


(9.5)  p(0p62  I V) 


C 

p^(P)-p(e^,02  I C^j+i,Q^j+TQ^T(P);C^2+TQ2T(P)) 


from  which,  for  example,  the  updated  means  can  be  obtained,  e.g.. 


C C , + i 

(9.6)  E{0,  I P}  - I p.(P)  • 

i-1  C^^[E{0^}]  + TQj^T(P) 


Although  these  Bayesian  calculations  are  more  complicated  than  before,  they 
are  easily  computerized  for  m * 2 or  3 . 

Another  possible  extension  is  to  a oompeting  lifetime  model  family 


(9.7) 


m 

p‘^(x  I 0)  - [ ir  exp  [-0.Q.  (x)]  ; 

i-1 

m 

p(x  I 0)  - ^ IT  0 q (x)  exp  [-0.Q.  (x)]  . 

1-1  ^ 


1 , are  random  variables, 


Here  the  i 0 (1  ■ 1,2,  ...,  m)  , ^ tt  ■ 

j-1  J 

with,  say,  a Dirlchlet  prior  distribution,  and  the  0^  may  be  fixed  or 
subject  to  a Joint  or  Independent  prior.  What  happens.  In  general,  is 
that,  as  data  Is  received,  the  posterior  estimates  of  the  change 

(along  with  the  0^)  , thus  tending  to  select  the  "correct"  model. 
Unfortunately,  even  for  m - 2 , this  approach  leads  to  terms  in  which 
varying  numbers  of  the  are  summed,  eliminating  the  usefulness  of  the 

TQT  statistic. 

However,  It  is  possible  to  extend  the  class  of  priors  somewhat  by 
using  a competing  prior  model  family 

m 

(9.8)  p(0)  - \ Tf  p (6)  ; P (x  I 0)  - exp  {-0Q(x)}  ; 

1-1  ^ 

where  the  are  either  fixed,  or  have  their  own  prior.  Here  the  size 

of  the  computation  remains  fixed,  and  the  TQT  changes  the  weights 
as  a function  of  the  data.  For  Instance,  if  the  p^(0)  - p(6  | 
ari  gamma,  and  the  Initial  are  fixed,  then  posterior  to  the  data 

u’e  have  from  (3.2); 


(9.9) 


TT^(P) 


K*tt  , 


r(c^i  + c) 

^'<^ol> 


^ol 


+ TQT(P)] 


C ,+C 
ol 


where  K Is  chosen  so  that  tt^(P)  - 1 . Of  course,  this  approach 
requires  much  more  effort  in  estimating  the  hyperparameters. 
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